Evaluation of model-based feature enhancement on the AURORA-4 task
نویسندگان
چکیده
In this paper we focus on the challenging task of noise robustness for large vocabulary Continuous Speech Recognition (LVCSR) systems in non-stationary noise environments. We have extended our Model-Based Feature Enhancement (MBFE) algorithm – that we earlier successfully applied to small vocabulary CSR in the AURORA-2 framework – to cope with the new demands that are imposed by the large vocabulary size in the AURORA-4 task. To incorporate a priori knowledge of the background noise, we combine scalable Hidden Markov Models (HMMs) of the cepstral feature vectors of both clean speech and noise, using a Vector Taylor Series approximation in the power spectral domain. Then, a global MMSE-estimate of the clean speech is calculated based on this combined HMM. This technique is easily embeddable in the feature extraction module of a recogniser and is intrinsically suited for the removal of non-stationary additive noise. Our approach is validated on the AURORA-4 task, revealing a significant gain in noise robustness over the baseline.
منابع مشابه
Robust feature extraction based on an asymmetric level-dependent auditory filterbank and a subband spectrum enhancement technique
In this paper we introduce a robust feature extractor, dubbed as robust compressive gammachirp filterbank cepstral coefficients (RCGCC), based on an asymmetric and level-dependent compressive gammachirp filterbank and a sigmoid shape weighting rule for the enhancement of speech spectra in the auditory domain. The goal of this work is to improve the robustness of speech recognition systems in ad...
متن کاملThe Effect of Different Task Types on Learning Prepositions in Form–Focused and Meaning–Focused Interaction Enhancement-Based Classes
The current study examines the impact of different task types on learning prepositions in form and meaning- focused interaction enhancement- based classes. The participants were 57 second Year University students enrolled in three intact lab classes at Tabriz Islamic Azad University. The first group was provided with form-focused interaction enhancement, the second with the meaning-focused int...
متن کاملFeature extraction and model-based noise compensation for noisy speech recognition evaluated on AURORA 2 task
We have evaluated several feature-based and a model-based method for robust speech recognition in noise. The evaluation was performed on Aurora 2 task. We show that after a subband based spectral subtraction, features can be more robust to additive noise. We also report a robust feature set derived from differential power spectrum (DPS), which is not only robust to additive noise, but also robu...
متن کاملRegularized MVDR spectrum estimation-based robust feature extractors for speech recognition
In this paper, we present two robust feature extractors that use a regularized minimum variance distortionless response (RMVDR) spectrum estimator instead of the discrete Fourier transform-based direct spectrum estimator, used in many front-ends including the conventional MFCC, for estimating the speech power spectrum. Direct spectrum estimators, e.g., single tapered periodogram, have high vari...
متن کاملTowards High-Accuracy Low-Cost Noisy Robust Speech Recognition Exploiting Structured Model
It is well known that the distorted speech can be considered generated from the clean speech with the additive noise and the convolutive channel as In this paper, we present our recent study on using this structured model of physical distortion for robust automatic speech recognition. Three methods are introduced for joint compensation of additive and convolutive distortions (JAC), with diffe...
متن کامل